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Patterns in Primes 


Our 17th contest appeared in our issue number 65, 
July 1978. The following strings of consecutive prime 
numbers were given: 


5651 1146791 
5653 1146793 
56577. 1146797 
5659 1146799 
5669 1146809 
5683 1146823 
5689 1146829 
5693 1146833 
5701 1146841 


Both of these strings have the same pattern of differences; 
namely, 2, 4, 2, 10, 14, 6, 4, 8. The contest called 
for the longest strings of consecutive prime numbers 
having the same difference pattern. 


The winning result--actually, two of them--comes from 
Sam Wagner, Bluffton, Ohio, shown here and on the cover of 
this issue. We believe that it is unlikely that longer 
lists will ever be found. 
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Mr. Wagner calculated all the differences between 
successive values of the first 6,016,001 primes, and 
assumed that each of these differences could be contained 
in 15 bits. 


[actually, up to the 6,000,000th prime, the largest 
such difference is 220, which is nicely contained in 
only 8 bits | 


The computing was done on a Data General Eclipse 
system with a 96 megabyte disk, using Fortran. 


All the differences except one are even numbers, go 
it 1s expedient to store the half differences. Differences 
over 15 were changed to zero, so that strings of twelve 
half-differences could be expressed in 48 bits for ease of 
manipulation. Successive sifting reduced the file of 
differences to eligible strings that finally yielded the 
two sequences shown. Mr. Wagner is richer by a TI-58 
programmable calculator, and we are richer by an interesting 
piece of knowledge. 
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The daily paper carries crossword puzzles like the 
ones shown here. Each puzzle consists of a 15 x 15 grid 
of squares, with some of the 225 squares blacked out. The 
number of blacked out squares varies from 28 to 42 and 
averages 34, In the last decade there have been over 3000 
of these puzzles published, and it is rare to have any pattern 
of the black squares repeat. It does happen, however. 


Picture yourself as having those 3000 puzzles at hand. 
We want to find all those puzzles having the same pattern 
(two or more times), including rotations and mirror images. 


Your problem is this: Is this a computer problem? 


If you conclude that it is, then outline the steps in 
your proposed computer solution. If you conclude that it is 
not, then outline your alternative method. The game we are 
playing is this: we do want to solve the problem (that is, 
don't waste time debating the usefulness of the project) and 
we do want to minimize our work, the costs, and/or the elapsed 
time to a solution--the question is, what is the best way 
to go about it? 


We can list some attacks on this problem that won't 
work: 


(a) We can't call the newspaper to ask them. The 
daily puzzies are syndicated and arrive by mail at the 
newspaper office. Moreover, that wasn't the problem; we 
want to find the duplicates, not just know that they're there. 


(b) If we could track down the person who composes 
the puzzles, it is doubtful if he or she would know, either. 
Why should he? 


(c) No, there is no optical scanner for sale some- 
where that will read black squares on a puzzle and put that 
information into machine-readable form. Our primary 
entry medium to the available computer is still the 
punched card. 


OOU 


The Crossword Puzzle Problea 


The problems that have appeared in our Homework 
series have been used in computing classes; they are time- 
tested written assignments. The Crossword Puzzle Problem 
on the preceeding pages is different--the "homework" is 
intended to be time spent on thinking over the economics 
of a solution. As it turns out, it would be foolish to 
use &@ computer on the problem, but it usually lures 
beginning students into devising elaborate and complex 
computer solutions. The problem can provoke an hour or 
more of healthy discussion, which is its intent. 
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Most beginners will plunge in with something like: 
"We could compare the elements in the first row of one of 
the patterns..." only to be dismayed at the thought that 
before any comparing can be done, the data to be compared 
must somehow be entered into the machine, a small point 
that they have overlooked. 


So if a computer is to be used, the critical problem 
concerns data representation and data entry, and such @ 
things as the cost of keystroking and how keystroking can 
be validated. 


Regarding the cost of keystroking: in any large 
city, you can buy about 9000 keystrokes per hour, for 
about. $4.50, with assurance of no more than two errors. 
(A competent keypunch operator will make many mistakes 
each hour, but will recognize that fact and correct each 
error, No more than two errors will survive per hour.) 
If you would like to have your work correct, then you will 
have it keyed twice and have the two decks of cards compared 
by machine. You are now getting effectively 10 keystrokes 
per penny (with the probability of 4 errors per 81,000,000). 


Students will invent elaborate schemes for digitizing 
the patterns of black squares, in order to present alpha- 
meric data to the keypunchers. The simplest such scheme 
is to punch zeros for the blank squares and ones for the 
black squares, and use up 225 keystrokes for each pattern. 
Each pattern will thus consume three complete IBM cards, 
with 225 columns of data and 5 columns on each card to 
identify the pattern and the card number (we learned a long @ 
time ago never to deal with unidentified data in our 
business). For the 3000 puzzles, this figures out to 
18000 fully punched cards, costing about $720 for the QQ 
punching and about $40 for the cards themselves. 


Eventually some alert student finds a shortcut to 
all this. Why punch 225 characters per puzzle? The 
patterns all exhibit symmetry of some sort, so that each 
pattern is uniquely defined by half of its black squares, 
or perhaps even by one-quarter of them. 


And now a class will begin to zero in on an efficient 
solution to the problem. Of the six patterns given with 
the assignment, no two are alike. How long did it take 
you to verify that? Patterns T, Q, and S all have the upper 
left square blacked out, but a glance at the next blacked 
out square of each pattern quickly eliminates any possibility 
of duplication. Similarly, patterns P and U have black 
squares starting in the 5th column, but the string of black 


squares is 3 long for pattern P and only 2 long for pattern U. 


Given the 3000 puzzles, it would take perhaps an hour 
to arrange them in no more than 8 stacks, depending on the 
position of the first black square. Each of these stacks 
can then be similarly sorted into substacks depending on 
the position of the second black square, and so on. All 
the duplicates in the 3000 puzzles can be located in this 
way in two to three hours, by one person, by hand. 


For an actual collection of 3000 such puzzles, the 
following is the state of affairs by actual count: 


One case of 6 alike. 

One case of 5 alike. 

Two cases of 4 alike. 

Six cases of 3 alike. 

Thirty cases of 2 alike. 
Eight cases of mirror-images. 


2887 unique patterns. 


Oe a 
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BASIC: An Introduction To Computer Programming 


by Robert J. Bent and George C. Sethares 
Brooks/Cole 1978, 8 1/2 x 11 softcover, $9.95 


...Stresses the techniques and programming 
principles useful in solving problems (in 
any language)....each chapter addresses a 
new problem-solving process. The 
progression of topics is related to 
semantics, that is, what the language can do, 
not according to syntax....flowcharting is 
introduced just after the IF statement, 
Since any introduction before this would be 
Seemingly contrived...By page 19 of the text 
the student can write and run a computer 
program!{4 


---from the brochure's hyperbole 


The title of this book places it in a category of 
textbooks which does not suffer from overcrowding; namely, @ 
the set of “language"-books that also attempt to teach 
the rudiments of computing. 


What are the aspects of such a book that could be 
done badly? Let's list some of them in priority order, 
worst first: 


1. The "facts" about the language involved are 
wrong. 


2. Various essential aspects of the language are 
omitted. 


3. The book fosters poor computing practices. 
4, Programs given in the book as models are wrong. 


5. Programs given in the book as models are poorly 
done (e.g., spaghetti programs). 


6. Examples and exercises are poorly chosen. 

7. The writing is bad. 

The above list, although not comprehensive, can @ 
serve to calibrate a new textbook. Let's apply it, with 
some charity,- to the Bent and Sethares text. 


An early exercise in the book (page 47) and its 
accompanying flowchart (Figure A) is given here: 


LET P = P+l 


Figure A. Flowchart for 


Example 11 of the BASIC text. 
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Example 11. A list of numbers is to be typed at the terminal to 
determine whether there are more positive numbers or more negative 

numbers in the list. Appropriate messages should be printed. 

Prepare a flowchart for this task. 8 


Problem analysis: Let's use the variable name A fpr the 
number being typed at the terminal. Since the length of 
the list is not specified, it must be assumed that lists 
of any length may be typed in. One way to allow for 
variable-length lists is to require the user to type a 
special value for A after the entire list has been entered. 
Of course, the computer must be programmed to recognize 
this number. For this example the number 9999 will be 
used and an IF statement with the condition A = 9999 

will be used to recognize it. 

To determine whether the list contains more positive 
or more negative numbers, we will use the variable P to 
count the positive numbers and N to count the negative 
numbers. After the entire list has been entered, these 
two values will be compared. 


Also shown here (Figure B) is another flowchart for 
the same problem, It is suggested that Figure A, shown 
as a model to beginning students, fosters poor computing 
(among other things that one might object to, its logic 
flounders around) and that Figure B represents a start 
toward good computing practices. 


We have tons of literature (whole books, in fact) on 
the symbols to be used on flowcharts, but almost nothing 
on what to put in the little boxes. Figure A is a fine 
example. Does the decision box--the diamond--imply a 
Question mark, or would it be better to use English 
sentences and include the question mark? That is, is 


good English, or not? Would an insistence on proper 
English help to foster good computing? For that matter, 
which of the following is a better indicator of clear 
thinking? : 


8 > 
A = 9999 —<$— A:9999 —— 
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= POSITIVE 


Figure B,. 


A different flowchart 
for Example 11. 


What will the student learn, when he sees a model 
flowchart in which A is compared to zero twice (needlessly) 
and P is compared to N twice (needlessly)? The logic 
in Figure B is at least as clear, is much shorter, and is 
arranged so that parallel ideas appear in parallel. The 
logic of Figure A is the sort of thing that might be 
developed in class on the blackboard, but the end result 
shown in a textbook should do better than that. 


The whole case for or against this book could rest 
on the fine points between Figures A and B, but we can 
look further. 
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On page 61, the reader is shown the logic of testing 
a number N for primality (a favorite exercise in beginning 
texts these days). It is pointed out, quite properly, 
that possible divisors of the number to be tested need not 
exceed the square root of N, but this logic appears on 
the flowchart as: 


yes 


instead 


of a 
this: 


no s 


--which is clearer?; which is more elegant?; which 


represents better computing? Even in BASIC, where square 
root is a built-in function, it is still cheaper to multiply 
than to invoke square root. Does it matter? Well, 


what is it that we are trying to teach? 


Up to page 88 (the book has 239 pages), the authors 
tend to follow the ANSI standards of flowcharting, using 
their own interpretation of those standards, as illustrated 
by Figure A. Beginning on page 88 they shift to the 
Michigan style, which features this symbol: 


yes 


no 


We have precious few agreed-upon standards in computing, 
but one that we do have is a standard set of flowchart 
symbols. If, given such a standard, everyone goes his 
own way, we tend toward chaos. It is irrelevant to argue 
that some non-standard notation is better, or that it 
represents how things "should be." It took 22 years just 
to get agreement that a flowchart decision should be a 
diamond. It is now more than 30 years in which we have not 
agreed as to whether to slash the letter or the digit. 
Beginning students (to whom this book is aimed) are not 
served by the use of parochial and oddball notation. 


In the chapter on sorting, the authors begin with 
bubble sorting--but it is the version of bubble sorting 
in which K(K+1)/2 comparisons, and possible interchanges, 
will be made regardless of the initial ordering of the K 
items being sorted. Again, they are fostering less than 
good computing. Much can be learned from a thorough 
discussion of the various versions of bubble sorting 
(particularly the version that capitalizes on any ordering 
that already exists in the data). 
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All the above criticisms might be debatable; there 
are points of view involved, and fundamental differences 
in beliefs about what constitutes good computing and how 
to teach it. What really triggered a lengthy review of 
a rather prosaic (but, on the whole, well-done) text was 
the following. 


In the chapter on random numbers, there are the 
following exercises: 


5. Simulate tossing three coins ten times. 
The output should be a list of terms such as 
HHH, HTH, HHT, and so on. 


10. Generate a list A of 100 different numbers. 


Any such list will do. For example, A(1)=1, 
A(2)=2, and so on. Write programs to do the 
following. 


a. Create a list B of approximately 20 
different numbers selected randomly from list A. 

b. Randomly select one number from list 
A, and then randomly select another from the 
remaining 99. 


c. Randomly select exactly 20 different 
numbers from list A. 


16. Two knights begin at diagonally opposite 
corners of a chessboard and travel randomly 
about the board but always making legitimate 
knight moves. Calculate the number of moves 
before one knight captures the other. However 
you number the squares, each Inight's move 
should be printed as it is taken. 


Problem 5 is a traditional finger exercise, to get the 
student started using the new function. Problem 10 is 
somewhat more challenging, and by the time a student gets 
to part (c), he will welcome eagerly any suggestions from 
an experienced teacher as to an efficient way to go about 
the assigned task. 
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But then we have problem 16, which is in a different 
league entirely. We will discuss this problem at some 
length. The point to be made is that the three exercises 
are several orders of magnitude apart in difficulty, and 
the student should be informed of this (even high school 
algebra books mark the difficult problems with a star) and 
be given some help to get started. 


One of the first problems to be solved in the problem 
of the two knights is how to represent, in numbers, the 
board position. Figure C shows a possible scheme, with 
markers in the starting position for the normal size of 
chess board (8 x 8). At this point, we are already 
planning ahead to a generalized version of the problen, 
with the board size varying from 4 x 4 up to 14 x 14, 


4 ah L fn 
12 
11 
[ge aa se fi =  ] 
es SA 
9 
8 
+1 eo 
7 
| Figure C. A numbering scheme for the 
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Figure D shows a broad-brush flowchart of a possible 
solution to the two-knights problem. A beginning student 
should be made aware that such a flowchart does not emerge 
full-blown; that the one he sees (all pretty and neat) is 
the cleaned-up version of several false starts, blind 
alleys, and illogical messes. 


For example, the generalization (to initialize the 
values for the coordinates of knight A to the limits of 
the board) would probably not occur to a programmer right 
away. Also, it might not occur to even an experienced 
programmer that the box labelled (*) on the the flowchart 
of Figure D is redundant--the same result will obtain by 
changing the other box to N+ 2-—>N (each time a knight 
moves on a chessboard, he moves from a black square to a 
white square or vice versa). 


The first attempt at flowcharting a solution to this 
problem had all the equality tests inserted at the point 
marked (*), thus wasting gobs of machine time on useless 
tests, until it became evident that a coincident position 
could result only on an even move. Thus we learn. 


From the broad-brush view to the nuts and bolts of 


a workable code is the next big step (probably requiring the 
drawing of several low-level flowcharts along the way). 


Subroutine 


1: x-coordinate = 2 
y-coordinate = -1l 
Subroutine 2: x-coordinate = 1 
y-coordinate = -2 Figure E. 
Subroutine 3: x-coordinate = -2 A possible scheme 
y-coordinate = -l for the eight 
possible moves of 
Subroutine 4: x-coordinate = -1 a chess knight. 


y-coordinate = -2 


Subroutine 5: x-coordinate = 2 
y-coordinate = 1 


Subroutine 6: x-coordinate = 1 
y-coordinate = @2 


Subroutine 7: x-coordinate 
y-coordinate 


= 


Subroutine 8: x-coordinate = -1 
y-coordinate = 


Average 


Board Number number Standard 
size of cases of moves deviation 
4 63 10.22 13.17 
5 53 36.30 32 .97 
6 54 65.44 SNS) lal, 
f 74 47 .00 37 .62 
8 5 68 .06 65.52 
9 74 108 .57 95.91 
10 55 122.84 100.34 
11 75 125.84 112.51 
12 35 144.17 112.32 
13 54 219.81 193.80 
14 25 294.16 128 . 36 
Some results for the two-knight 
problem. 
Figure F. 


For example, the logical operation at Reference 2 is 


"Select move at random for knight A." This could be done 
in many ways. One possible way is the following. Set 
up 8 subroutines as in Figure E. Use whatever random 


number generator you have (e.g., the RND function that is 
built into your BASIC, although that might be the worst 
choice) to select a random integer between 1 and 8; name 
_this integer R. Then call subroutine R. The values 
of x and y thus produced can be applied to (a,c) or (b,d) 
in order to effect one of the 8 possible legal moves that 
a chess knight can make. 


For the 8 x 8 board, using the numbering scheme of 
Figure C, the result of applying a legal move to either 
knight A or knight B must meet the conditions that the 
coordinates a, b, c, or d must be less than or equal to 10 
and greater than or equal to 3, or else the "legal" move 
has taken us off the board and a different move must pe 
selected, again at random. 


Figure F shows some results for various sized chess 
boards. The result for the 7 x.7 board seems questionable, 
but at least we have an answer for Problem 16 of the book. 
It is possible that there is a fundamental difference in 
the random action of the two knights, depending on whether 
the board has an odd number of cells on a side or an even 
number of cells--thus, as always, the end result of this 
tiny bit of research is an indication that more research 
is needed. 


EJ 
O 
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Lo! the poor Indian! ...Alexander Pope 
There is a legend that says that Manhattan island - @ 
was purchased from the Indians in 1627 for $24, If 


that $24 had been invested at 3%, compounded annually, 
how much would it amount to in 1978? 


769267 .019033525979167457 58082995 5697417 3827 3679 358079789898 3 
504097 369 3683217414 34894160698 3612 3842911048754 3381589807415 
5194768879544915 332447 30025662307 253954960616566118347 546898 
89826957867 5488106584 647 33188657 3370680646117436205971280077 
51421079969194 398769748508 16 386145 344217 45486688 32170 38454904 
42647685 3301169661020 3221178690092021087 5422691897 3149123706 
95767993877486502746608 31578012968 364440 300920507 270181057 33 
457 361344095469680948053327 239502 39069977 598599 3928980184248 
845625541247659804676670151319400769514241590506460782667 393 
2151012513852 36246017413128965757 1922854556282 389087 74613967 & 
53685226948707025460 37592807 50948569624 35040017 5025965382161 
800947 54526 35724722591 310690 3620676 3815296063528 


The figures above are the result, as calculated step- 
by-step, retaining every digit at every step. On the facing 
page are more results for this problem. All these numbers 
are answers to the same well-defined problem, and they 
are all correct. 


We are conditioned to the belief that there 1s always 
just one answer to a well-defined problem (that is, after the 
first shock of discovering that our answer of 53/3 to the 
problem in our arithmetic book didn't quite agree with the 
back of the book, where it was given as 17 2/3, or maybe 
as 17.67). So how can there be ten different answers, all 
correct, to the same problem? 


The long calculation above can be checked, somewhat. The 
number of digits (708) checks with logarithms. The low-order digit 
cycles 4, 2, 6, 8, 4, 2, 6, 8 through the solution; the low-order two 
digits have a cycle of 20; the low-order three digits have a cycle 
of 100. Thus, some small calculation can verify that the 528 at 
the end of the number is correct. 


Calculated in fixed point arithmetic, 
with each year's result rounded to 
the nearest cent. 


Calculated in fixed point arithmetic, 
with each year's result truncated 
to the nearest cent. 


Calculated in fixed point arithmetic, 
with each year's result truncated 
to the nearest 1/10 cent. 


Calculated in floating point arithmetic, 
step by step for 351 years, using a 
9-significant digit package. 


Calculated in floating point arithmetic, 
step by step for 351 years, using a 
lé-significant digit package. 


Using logarithms and the compound 
interest formula, le significant 
digit logarithms. 


Using logarithms and the compound 
interest formula, 6 significant 
digit logarithms. 


Using logarithms and the compound 
interest formula, 5 significant 
digit logarithms. 


Using logarithms, and the compound 
interest formula, 4 significant 
digit logarithms. 
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769509 .07 


763936 . 33 


7687 34 .195 


769267 .03 


769267 .02 


769267 .02 


769266 .00 


769050 .00 


780500 .00 


More results for the Manhattan Problem (1978) 


($24 at 3% for 351 years, compounded annually) 


, Incidentally, if the Indians could have commanded a more modern 
10%, instead of 3%, their bank account today would be about 8000 
times the current annual Gross National Product of the United States. 


OOO 
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TOVAY 15 TUcsUAY, MARCH 85, 1907. 
BOMBER WING SPAN 15 258 MILLIMcCTERS. 


ISSUE OVERTIME UHcCK FOR $35872,654.24 


$375400264 X 105751522 = 535897932384 o2645SS5ES2799~ 

OZTAGNOSTIC PaSScuU OK=-THE GUMPUTER WORKS PcRFEUTLY. 
OLTAMETcCR GF GONTROL ROUS SHOULD sc $1 FEET. 

VEHICLE 263 MADc 78 TRIPS MuY 159, 1973. 
SCCTAL SECURITY NUMBER IS 85061429. 

GASOLINE CONSUMPTION WAS -bbU3L18 MILES Per GALLON. 
GU »021,3885 LS A PRIME NUMBER. 

YOUR DIET REQUIRES 1264 LBe OF BUTTER PER UAY. 
RIVER TcMPERATURE IS 512.8 OtGrecsS Fe 


HOMc FHONE NUMBER IS 54076236. 


(Everything above is a reproduction of actual computer printout) 


Output Is A Function Of Input 


Cole 


